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Chapter One 


A POISSON SCENARIO* 


Overture . 
The following scenes are sketched to show how the POISSON system can be 
used in the classroom. At its exciting climax, we present a real POISSON run 
and explain the major steps involved. Other sample runs can be seen in Chapter 


Five. 


Act 1 
Curtain rises on a UNC-Greensboro classroom, presided over by Professor 
Knocks, who is speaking to a group of 25 young Carolinians, trying to find 
political relevance in a blank biennium. 
PK: How many of you had a chance to look at today's assignment, pages 
219-249 of Robert Alford's Party and Society, Rand McNally & Company, 
Chicago, 1963? (Professor Knocks is a bit cf a pecant.) 


A forest of hands pop up (remember, this is fiction). 


PK: Well, now what seems to be his main argument? Can you tell us, Mr. 
Striver? 
ERNEST STRIVER: Well, Professor Knocks, Sir, Alford argues that blue 


collar people are more likely to vote for the more liberal party, that 
is the Democrats, and that this difference is essentially constant 
over time. 


Is 


Good work there, Striver, but doesn't he refine his argument further? 


Ie 


As a matter of fact, he does just that, as you said Professor Knocks, 
Sir. He further argues that this difference is not as strong among 
Catholics and not as strong in the South (the hushed silence now 
becomes deafening). He attributes such a specification--as we call 
it in technical lingo--to subcultural differences. 


*This scenario was adapted from the PRIMER of Project IMPRESS at Dartmouth 
College. Hopefully, enough ivy has been cut to produce a Tarheel classroom 
atmosphere. Other chapters in the primer have also borrcwed heavily from the 
Project IMPRESS primer. 


(PK totters over to the blackboard and chalks up the following figures.) 


Difference (Per Cent Democratic among Blue Collar Minus 
Per Cent Democratic among White Collar) 


Religion Region 
Total. Cath. Prot. P-C South Totai-South 


sample 


sample 2 


sample 1 


sample 2 


After a period of discussion, net unlike extrative dentistry, the class 
comes to agree with Striver's interpretation. The cciumn headed "TOTAL" does 
show a fairiy constant occupational difference. The column headed "P-C" shows 
this dirference to be greater among Protestants in all but one case, and the 
column headed "'Total-South" shows the difference te be greater in the total 
population among Southerners (these data, by the way, are for real; but Alford's 
published tables don't make it easy to get two separate figures for South and 


non-South; thus, we compare with the total U.S.). 


PK: Well, now is this still true or have there been changes since 1960? 
After all, that's 12 years ago. 


(The silence now is pierced by sotto voce grumbling that this wasn't in the 


reading assignment. After shock subsides, various tentative ideas are voiced.) 


MR. X: I don't think so. 


MR. Y: Well, there was this Playboy interview... 


MISS 2: I have some Black friends at A and T who say... 
(Professor Knocks waves them all aside.) 


PK: Doesn't anybody have any hard evidence on this gripping question? (A 
hand goes up in the back of the room. Attached to it is Billie Joe 
Williams, a shy lass who had spoken little during the term so far. 
From her unfashionable knée length dress and short-clipped hair, 
Professor Knocks Judges her to be a scholarship student from rural 
North Carolina. ) 


BJW: Ah think POISSON could give us the answer. 

(All male heads are seen to flush jealously.) 

Mr. Xs Goliie! I should have thought of that myself. 

Mr. Y: Wooee!: 

PK: Tell us more, Miss Williams! 

BIW: Well, there is a survey called PRES68 available through the POISSON 
system. It is a national sampling of American adults in Fall, 1968, 
and I'm sure it has questions about voting, region, occupation, and 
religion. 

PKG: I think you have something there, Miss Williams. I see we are getting 
to the end of the hour. Will ye4 please run PRES68 for tomerrow and 


try to find out whether Alford's conclusions are still valid. 


(EXIT all in a dither.) 


The Reserve Reading Room of the library. Billie Joe is discovered pouring 
over a PRES68 codebook. By subtle pantomime it is revealed that she finds 
region to be measured by a POISSON variable called REGIQ@N, religious prefer- 
ence by RELIG, occupation by QCCl, and voting preference by V@TED. She jots 


some notes and exits running. 
Act 3 


The terminal room in the Science Hall: Billie Joe is undaunted by chemis- 


try odors and the other foreign atmosphere. She sits at a keyboard terminal 


(in this case a teletype), follcws the usual sign-on procedures, and generates 
typed cutput. 
Here, we depart from our drama to fcilow the actual run of POISSON. Billie 


Joe's input follows each question mark (7). 


SERVICE? 

. CPS 
2xxx yyy .22z,WiLIITAMS 
GG@OD AFTERNGGN: USER 09; TIME 16:30:50 10/28/71 
7L@AD (POISGN) PUBLIC 

B 2XEQ 

: D@ Y@U WANT USER'S INSTRUCTIONS? 

7NO 


ENTER NAME @F SURVEY 
? PRES68 

1968 ELECTIGN SURVEY -- MICHIGAN SRC 
ENTER NAMES @F 2 T@ 8 VARIABLES TG BE RUN 
(COMMAS) - N@ BLANKS PLEASE. 
?REGIGN, RELIG ,@CC1, V@TED 


i=) 


ENTER GROUPING OPTION 
?SD, EXCEPT ,RELIG,MAP,V@TED ,MAP 


Billie Joe follows the usual sign-on procedure for CPS (conversational 
programming system) in order to use the interactive version of POISSON. 


She asks for the CPS library program, POISON. A question mark follows and 
she requests that POISON be executed (run). Being an old hand, Billie 
Joe by-passes the user's instructions. 


The program asks her to type in the name of the study she wants (PRES68) 
and the names of the variables she will be working with (REGIQN,RELIG, 
QCC1,VQTED). 


The program asks her whether she wants to work with pre-defined category 
groupings within her variables or whether she wants to make her own group- 
ings. Billie Joe tells it that the standard dichotomy for REGIQN (South 
v. All Other) and OCCUPATION (White Collar v. Blue Collar) are fine but 
she wants to make her own category groupings for RELIG and V@TED. 


H@W MANY NEW CATEGORIES? 
13 


GROUP 1? 
7? PROT ,0-4 


GRGUP 22 
2CATH,5 


GROUP 3? 
7QTHNA, 6-13 


ENTER EXCLUSIGN CQ@DE 
cal 


H@W MANY NEW CATEG@RIES? 
25 


GROUP 17 
?HHH,O,1,14,15 


GROUP 2? 
?RMN,2,3,16,17 


GROUP 3? 
?OTHNA ,4-13,18-34 


ENTER EXCLUSIGN CODE 
21 


ENTER C@MMAND 
?7MARG : RELIG, V@TED 


C@MMAND (YES GR N@)? 
7 YES 


"RELIG' LOCATED--RELIGIGUS PREFERENCE 


"V6TED' LOCATED--VGTED ¢@R WGULD HAVE VOTED FGR WH@M IN 68 


Y@UR C@MMAND, VARIABLES, AND @PTI@GNS HAVE BEEN 
ACCEPTED. WOULD Y@U LIKE Té ENTER AN@THER 


2% 


F, 


The MAP option allows Billie Joe te dichctcmize RELIG inte Protestant 
(PROT) and Catholic (CATH) while lumping what responses are to be excluded 
into a third group, QTHNA. She then enters an EXCLUSIGN C@DE of 1 which 
excludes from analysis the group with the highest number (group 3 in this 
case). (An exclusion code of O would exclude no groups, 1 would exciude 
the group with the highest number, 2 would exciude the twe groups with the 
highest numbers, etc.) 


She then goes through the same process fer the variabie, VGIED. 


The program tells Billie Joe that it has her variables all arranged and 
asks for instructions with the request, ENTER C@MMAND. She first asks for 
the marginals (one-variable frequency distributions) for her home-made 
variables, RELIG and VQTED. Since the interactive version cof POISSON is 
built to direct the user in how to punch cards to get the answer, ne answer 
is given her at this point. The pregram asks her if she would like to 

pese other questions of the variables she has chosen. 


ENTER C@MMAND 
?XTAB:@CC1,V@TED;%A 


YOUR C@MMAND, VARIABLES, AND @PTIGNS HAVE BEEN 
ACCEPTED. W@ULD Y@U LIKE T@ ENTER AN@THER 
C@MMAND (YES @R NG)? 

YES 


ENTER C@MMAND 
?XIAB:RELIG,GCC1,V@TED;ZA 


YGUR COMMAND, VARIABLES, AND @PTIGNS HAVE BEEN 
ACCEPTED. W@ULD Y@U LIKE T@ ENTER ANOTHER 
CGMMAND (YES GR NO)? 

“YES 


ENTER C@MMAND 
¢XTAB:REGIGN,@CC1,VGTED;%A 


Y@UR C@MMAND, VARIABLES, AND @PTI@NS HAVE BEEN 
ACCEPTED. W@ULD YOU LIKE T@ ENTER ANOTHER 
COMMAND (YES @R NG)? 

ING 


THE F@LL@WING IS A PRINT-GUT GF YOUR CO@NIRGL CARDS FOR 
A ‘P@ISS@N' RUN. THE FIRST TWO LINES ARE CARD C@LUMNS AND ARE 


NOI T@ BE PUNCHED. 


1 1 2 2 3 3 4 4 6 6 7 
ies, OS Bes CLT O RA aw BRS POip 2 oo Tut Ot 0.0: Moyo oR OG Oo. Dip Pete na Be alae 
ID PRES68 REGIGN,RELIG,@CCi , V@TED 
GROUPING SD RELIG=MAP , V@TED=MAP 
MAP RELIG,1  PR@T,0-4;CATH,5;@THNA,6-13 
MAP V@TED,1 HHH,O,1,14,15;RMN,2,3,16,17;@THNA,4-13, 18-34 


COMMAND MARG:RELIG,V@TED 

C@MMAND XTAB:GCC1,V@TED;%A 
C@MMAND XTAB:RELIG,@CC1,V@TED3%A 
COMMAND XTAB:REGI@N,@CC1,V@TED;ZA 


?7LGGGUT 


TIME 13:58:30 TIME USED: CPU 00:00:01; TERM 00:14:10; 
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PAGE 00:52:313 


Billie Joe then gives three consecutive CGMMANDS to pull out data which 
will give her three entries to Professor Knocks' table for the 1968 
survey. She requests the percentage preferring Humphrey and Nixon among 
White and Blue Collar Workers, first without a control variable, then 
controlling for RELIG, and finally controlling for REGIQGN. No further 
COMMANDS are requested. 


Billie Joe is directed to the exact format for card input to the POISSON 
system. She also finds out she has been on the terminal about 14 minutes 
to get her questions into this format. Now, the "batch" version of 
POISSON is used to obtain the data requested. She punches up her cards 
and submits her runs. Output from the "batch" run follows. 
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The batch output of a POISSON cun first echoes back the input cards. 
These are not repeated here for simpiicity. 
YOUR CGMMAND: MARG:RELIG,V@IED 


MARGINAL FREQUENCIES 


RELIG =  RELIGIGUS PREFEKENCE 
PROT 1167 69.8% 
CATH 348 20.8% 
OTHNS 158 9.4% EXCLUDED 
Sy 
VOTED = V@IED GR WOULD HAVE VOTED FOR whdm IN 68 
HHH 635 37.9% 
RMN 613 36.64% 
@THNA 425 25.44 EXCLUDED 
SAMPLE SIZE = 1673 
Y@UR C@MMAND: XTAB:G@CC1 VOTED ;%Za 
GCCL BY VGTED 
DOWN: RESP@NDENT' 3S MAIN @CCUPATIGN - POL BEA CODE 
ACROSS: V@EED GR WOULD HAVE V@IED FOR WHOM IN 68 
PERCENTAGES ACR@3S 
= So ae 
HHH RMN T@TAL 
WHITE 45.8% 54.2% 423 
BLUE 60.6% 39.3% 407 
TOTAL Sao 7 46.8% 830 
EXECUSI@N ANALYSIS: 
TABLE TOTAL 2 830 
EXCLUDED = 843 
SAMPLE SIZE = 1673 
a 2= 


did 


The first output Billie Joe receives are a table of marginals by which 
she receives some idea of how large her ceil entries are before she 
continues analysis. 


The second CQMMAND produces the percentages preferring Humphrey and Nixon 
ameng White and Blue Collar Workers. Subtracting 45.8 from 60.6, Billie 
Joe has preduced ancther entry to Professor Knozks' table on the blackboard 
under the column T@TAL. This differentiai is the same, she says, as that 
of 1960! 


ah Fe 


YGUR COMMAND: XTAB:RELIG,@CC1,V@TED;%A 
RELIG BY @CC1 BY VOTED 


CGNTR@L: RELIGI@GUS PREFERENCE 
DOWN: RESPGNDENT'S MAIN GCCUPATIGN - POL BEH CGDE 
ACR@SS: VOTED OR WOULD HAVE VGTED FOR WHOM IN 68 


PERCENTAGE ACRGSS 


RELIG = PROT 
3 HHH RMN TOTAL 
WHITE 37.1% 62.8% 277 
BLUE 61.4% 38.5% 306 
TOTAL 49.9% 50.0% 583 
RELIG = CATH 
HHH RMN 

WHITE 61.9% 38.0% 
BLUE 75.2% 44.7% 
TOTAL 59.4% 40.82 
EXECUSI@N ANALYSIS: 
TABLE TOTAL = 764 
EXCLUDED = 909 

=te 1679 


\ ie SIZE 


aié= 


Billie Joe's third command produces a percentage analysis controlling 

for RELIG. She sketches down the differences in order to fill in the 
RELIGIGN column of the blackboard table. A difference of +24 for Protestants 
and -7 for Catholics shows the subcultural differences to be even more 
dramatic than in 1960! 


ahSe 


YOUR C@MMAND:  XTAB:REGIGN,G@CC1,V@TED;%A 


REGION BY @CCi BY VOTED 


CONTROL : RESPGNDENT'S RESIDENCE 
DOWN: RESPONDENT'S MAIN GCCUPATIGN - P@L BEH CODE 
ACRGSS: VG@IED GR WOULD HAVE VOTED FOR WHG@M IN 68 


PERCENTAGES ACRGSS 


REGION = NIS@UT 

HHA RMN T@TAL 
WHITE 44.8% 55. 303 
BLUE D104 44.9% 209 
TGTAL 49.64 50.34 57z 
REGIGN = SGUTH 
WHITE 48.3% 51.6% 120 
BLUE abet 28.2% 138 
TQ@TAL 60.84 59.44 256 


EXCLUSION ANALYSIS: 

TABLE T@TIAL = 830 
EXCLUDED = 843 
SAMPLE SIZE = 1673 


rier 


’ 


M. Billie Joe's final CQYMMAND produces her percentage difference analysis 
with REGIQN as a control. She calculates the percent difference for 
the SQUTH to be 23.4 (the last blackboard table column), makes a quick 
comparison with the percent difference for the total nation (14.5) and 
excitedly runs off to prepare for the next class. 
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Act 4 


Again in the Classroom. 


Mass confusion reigns--a student has tripped over several yards of tele- 
type paper being drzgged aiong; a smail group is huddled around Billie Joe; 
and several students are running around asking questions. Professor Knocks. 


enters and crder is reestablished with his first statement. 


PK: Is Alford still correct on his generai proposition? 


Ten hands are raised! Ernest Striver blurts out: 


ERNEST STRIVER: I got the same answers as Biliie Joe! ‘MIhe general proposi- 
tion of Alford is OK and so are his cbsexvetions on the subcultural 
differences as a result of the reiigicus effect. I don't know how 

< to interpret the regional voting patterns--something is wrong! 

Ernest Striver is aiiowed to enter the calculated percentage differences in the 

blackbcerd table, His results do indéed agree with Billie Joe‘’s and those of a 

few other students. A discussion follows and fair agreement with Aiford‘s 

generai propositicn is reathed. The "religious effect" is more dramatic and 


is also in agreement with Alford's observations. 


PK: Can anyone cifter an interpretacion of the resuits on regional veting 
patterns? 

MR. X: Why wasn't REGION dichotomized into New Engiand and not New England? 

=r: Astute observation! The standard dichotomies for PRES68 were built 


by Yankees. However, you could map your own. Back to the question-- 
on che Southern voting patterns? 


MISS Z: My friends at A and T say there was an amazing increase of Biack votes 
in 1968: 

PK: Ccouid this explain the reversal? 

BoW: It certainly could. Black people tend to be concentrated in Blue 


Coliar jobs and Black people tend to vote Democratic. If their 
voting rates are increasing, this could produce a status effect here 
in the South where a high proportion of the population is Black. 

PKS Is there any way we could test this? 


BIW: I’m certain that RACE is a variable in PRES68. We could go back and 
repeat the analysis with RACE as a control. 


PK: Right on! - ---- - 


=e 


Professor Knocks is cut off by the rush of students in several directions. 
Billie Joe leads a group out the door while several students bombard PK with 


questions as to the iccation of the teletype, science building, etc.! 


ae) SRIRAM — ory ae 
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Chapter Two 


DESCRIPTION OF THE POISSON SYSTEM 


A. General Introduction 


POISSON," Package Of Instructional Social Surveys OF North Carolina, is 
a computer system for selective retrieval and analysis of large data files in 
the social sciences. This system allows searching social science data bases 
for hypothesis testing within a particular study using specific variables. 
Selected combinations of 2 to 8 variables can be analyzed using built-in stan- 
dard dichctomies, standard groupings, or the raw categories of each variable. 
if desired, the variable categories can be "remapped" to user defined 
categories, In the 197i version of POISSON, variable arialysis is limited to 


the discrete mode. 


POISSON is available from the IBM 370 modei 165 computer at the Triangle 
Universities Computation Center, TUCC. Four distinct program packages are 
involved in the system: a tutoriai-interactive package (caiiea POISON) which 
instructs the user in preparing the input data for a run, a batch computational 
package (called POISSON) which analyzes the data and produces the analyses 
requested, a data base to the system, and an interactive program package 
(2XZ2X2 and WORDS) for three-variable analysis of a user's own data. The use of 
any of these packages requires no programming knowledge. POISON, 2X2K2, and 
WORDS are available from the conversational programming system (CPS) at TUCC 
and are written in PL/I. POISSON and ADDER are batch programs also written 


in PL/I. 


The version of POISSON described in this PRIMER was built around two data 
bases--PRES68, the Michigan-SRC 1968 presidential survey, and ETHNO, Murdoch's 
Ethnographic Atlas-Ethnography version. Other data bases are being considered 
for inclusion in the system. PRES68 and ETHNO were selected with the hope of 
making social science instruction more inductive while aiso otfering research 


potential. Additional data bases would enhante the latter purpose. 


“POISSON was built from the IMPRESS system of Dartmouth College. Permission 
for the building of the system, the use of IMPRESS codebsoks, and the modeling 
of this primer after the IMPRESS primer has been sbtained from Edmund Meyers, 
Director, Project IMPRESS. 
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Support of the POISSON system is under the direction of the North Carolina 
Educational Computing Service, NCECS. Codebooks for PRES68 and ETHNO can be 


purchased from NCECS. Documentation and user service are obtainable from NCECS, 


as is assistance in the process of adding a data base: NCECS, P. 0. Box 12175, 
Research Triangle Park, N. C. 27709; phone (919) 549-8291. 


B. The Data Library 


The 1971 version of POISSON has two studies on-line: PRES68 (the 1968 
election survey--Survey Research Center, University of Michigan) and ETHNO 
(ethnology version of Murdoch's.Ethnographic Atias). PRES68 has a total 
unweighted sample N of 1,673 and the POISSON system allows analysis of 109 
variables in the data set (this is a subset of the total number of variables 
availabke--over 500). ETHNO has a total unweighted sample N of 1,168 and a 
subset of 91 variables is available from POISSON. Other data sets are being 
considered for addition to the system and the size of the data sets to be 


added should be no larger than that of PRES68. 


Adding a data set to POISSON necessitates an "inversion" ot the original 
data set from respondent-oriented raw data to a variable-oriented file. The 
system operates on analysis of variables and the codebsocks are built by 
variables, each of which contains frequencies for each category of the vari- 
able. Ancther requirement for adding a data set is the need to build standard 


dichotomies and standard groupings for each variable. 


Data sets available from Dartmouth Coilege’s project IMPRESS are easier 
to add to POISSON than a new data set. IMPRESS data sets include the variable 
subsetting, the formation of standard dichotomies and standard groupings, and 
codebooks giving the frequencies for each category of each wariable. The 
IMPRESS data set itself is not available, however, so that the raw data must 
be collected and inverted to the format given in the IMPRESS codebooks. Data 
sets not available at project IMPRESS require, beyond the inversion process, 
the decision-making for subsetting of available variables and for the forma- 


tion of standard dichotomies and standard groupings. 


To aid in adding a new data set to POISSON, NCECS has made available a 
package called "ADDER". This package of programs inverts the original data 


set to the POISSON variable-oriented format once ail of the necessary decision 
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¢ 


making described in the previous paragraph has been accomplished. Dscumentation 


on "ADDER" is available from NCECS. 


Codebooks are necessary for adequate use of POISSON, especially when a 
user needs to map his variable categories to his own liking. PRES68 and 
ETHNO codebcoks are fortunately available from Dartmouth, with minor changes 
neted in an NCECS addendum tc the codebooks. The production of a codebook 
for a new data set should be considered in the process of adding to POISSON. 


Almost 30 additional codebocks wére available from Darcmouth in 1971. 


C. Access to the System 
The. IBM 370/165 at TUCC is accessible wia several térinel types. 
Keyboard entry to TUCC is necessary for use or POLSON, the interactive version 
of POISSON. Terminals chat have this power include tceletypes, IBM 1050's, 
IBM 2741's, and DATEL's. This same keyboard restviction is necessary to use 
the pregrams 2X2K2 and WORDS, the thrée=-wariablé anaiysis tuterials. It should 
be noted that keyboard entry to TUCC is currently limitea to Low-speed 


terminalis. 


POISSON usage is not limited if no keyboard faciitvies are available. 
The batch system contains the entire system and terminais witnout keyboard 
facilities (such as the IBM 2770 and 2780) can be used for survey analysis. 
Chapter 4 summarizes the format requirements for card entsy co the batch system. 
Bateh-cnly users are forced to learn the formatting problems (Yormidable when 
remapping wariables) through which the interactive systea, POLSON, guides them. 
An instructor with large classes may find the usé of the batch system rinancially 
rewarding. At any rate, the interactive system, with its tutorial approach, 
soen trains a user (after 2 or 3 runs) to go directiy to the batch system without 
using the interactive system again. All users will find this approach financially 


rewarding. 


Accessing the three variable analysis tutorial programs is by keyboard 
terminal only, since both 2K2X2 and WORDS are in the CPS system (interactive 
PL/I). These programs allow three-variable cross-tabulation with analysis by 
gamma and have the advantage of allowing a user to submit his own data for 
analysis (the POISSON system is restricted to mounted data sets). These programs 


are described in Chapter Six. 
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Chapter Three 


A WALK THROUGH THE INTERACTIVE VERSION OF POISSON 


Introduction 


Tne scenario of Chapter One assumed a user experienced witn POISSON. 
While the interactive version of POZSSON supplies a tutorial approach to 
produce experienced users, it is advantageous to documeat this interactive 
system in detail for local trouble-shooting support. Further, the options for 
assistance in formulating a question to a data set are vicn in the interactive 
system and an overview is almost essential. If an interested user has no 
interactive facilities (low speed, keyboard terminsi), he must read this 
chapter since it outlines the logic necessary for analysis. However, it 
is repeated here that the use of POISSON is in no way pteciuded fer users without 
interactive facilities. The 1971 interactive versicn mereiy formulates the 
card format for running the system and doés no analysis itself. 

Once the interactive program is called, as shown in the scenario of 
Chapter One, the program first types "DO YOU WANT USER'S INSTRUCTIONS?" 
These instructions call attention to two leveis cf assistance availiable at 
every stage in which the computer asks fcr input. The user may type in: 

EXPLAIN (for a brief explanation) 

DETAIL (for a longer explanation) 
It is suggested that the beginner type, "YES" for user's instructions and 


"DETAIL" for every initial new request from the computer. 
The skeleton of the interactive wersion of POISSON looks like this: 


Cl ara a eB) eG) 


Choose Choose Choose Choose 
Study Variables Grouping Commands 
Option 


The process will be stepped through from left to right just as the user sees 


it. Each stage is equipped with instructions if the user types in "EXPLAIN" 


or “DETAIL”. 
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(A) Choosing a Study 

When the system types, "ENTER NAME GF SURVEY", it is asking you to type 
in the name of the specific survey with which you wiil be working (cnly one 
survey at a time). 

ENTER NAME OF SURVEY 

?PRES68 
If you just want a list of available survey names, type LIST. This request 
will produce the codenames of aii available surveys on line. After these 


names are given, the previous question is repéated by the computer: 


ENTER NAME @F SURVEY 


This repetition of the question will occur after ali responses such as EXPLAIN, 


DETAIL, LIST, and DESCRIBE (see below) which ask for assistance before answer- 
ing the question itself. An improper speiling sf the survey codename sr the 
typing of a name not availabie will cause an error message and again a vepeat 


of the question. 


(B) Chcosing Variables 
Having located the survey requested, the computer then requests: 


ENTER NAMES QF 2 I 8 VARIABLES T@ BE RUN (C@MMAS)--N@ BLANKS PLEASE 

7 

Every variable in every POISSON survey has a4 specific name (e.g., MARITL, 
REPUBS, R@CKY). They are chosen to be méaningful (ytu can guess that MARITL 
has something ts do with marriage, REPUBS with Republicans, and R@CKY with 
Rockefeller). Sometimes, though, they are cryptic; (would one guess that 
NEGATT refers to the respondent's estimate of the opinicn of Negroes in his 


area regarding desegragation?). 
A proper response to this question would be: 
?MARITL,REPUBS,R@CKY , SGUTH, WHO IGB, WARL 


Note the formatting with respect to commas and blanks. A request for assist- 
ance by a response of EXPLAIN or DETAIL would expose five specific approaches 
to getting more information about the variabie in the survey: 
(1) Type LIST and the system will type out the names of all variables 
in the study. Warning: Up to 109 variable names is a jot of 


time-consuming output. A careful inspection of the codebook could 


=96- 


as) 


(2) 


Ci 


(4) 


(5) 


eliminate this. 
Type DESCRIBE:MARITL,REPUBS,R@CKY and the system will type out a 


capsule (50 characters or less) description of these three variables. 


Type DESCRIBE:MARITL,REPUBS,R@CKY;SD (note the absence of blanks and 
the specific punctuation) and POISON will produce the short descrip- 
tion and category labels with accompanying frequencies for the 


standard dichotomy (SD) of each variable. 


Type DESCRIBE:MARITL,REPUBS,R@CKY;SG and the items of (3) above 


will be produced for the standard groupings (SG) of each variable. 


Type DESCRIBE:MARITL,REPUBS,R@CKY;ALL and POISON will give all of 
(2) and (3) above as well as the total number of categories for the 


raw data of each variable. 


All of these aids in choosing variables will be followed by a repeat of the 


request for the names of the variables to be chosen. 


(C) Choosing the Grouping Option 


The variables having been selected, the next computer-generated request 


will be: 


ENTER GROUPING OPTION 
? 


Typing EXPLAIN or DETAIL will output an explanation of six options available 


to answer this request, all having to do with how the categories of each vari- 


able are to be grouped for analysis: 


(1) 


(2) 


(3) 


IND the grouping of each variable will be entered individually. 
This option is one way to group each variable in a different 
manner (often desirable), these manners being options (3), 


(4), (5), and (6) following. 


DESCRIBE prints a short description of each variable label (handy 
for cryptic labels which make no sense), prints the SD and 
SG categories and frequencies, then asks for one of the 


following grouping options. 


MAP allows user to regroup the raw categories to his own choos- 
ing and therefore rejects the raw, SD, and SG groups of the 


codebook. 


af}e 


(4) 


RAW 


SD 


gives the user access to the categories as they were in the 


original data. 


stands for "Standard Groups"; see codebosks for these built 


in groupings of three or more categaries for each variable. 


stands for "Standard Dichotomy"; choosing this option accepts 


the two codebook SD categories together with a third group 
of leftovers ("DGNT KNGW, NQ ANSWER, AND QTHERS."') 


SG and an answer to the request; 


ENTER GRGUPING @PTIGON 
9 


The inexperienced user will probably feel most comfertabie using SD or 


of SD cx SG will establish the chosen grouping for ail variables previously selected. 


Eventuaily, the user may try te MAP (regroup) the raw categories in otder to test 


his chesen hypothesis. 


exactly as in the following table: 


The use of the MAP option is somewhat ceompiex and is specificaily ilius- 


trated for the variable KIDS of survey PRES68. 


In the cedebook, KIDS appears 


KIDS P, 2A DECK #1 COL. 53 - 53 
ARE THERE ANY CHILDREN UNDER 18 YEARS @LD IN THIS FAMILY? (HOW 
MANY?) 
FREQ. % SD SG IMP @RIG 
LABEL LABEL i Soi ss 
858 51.2 NONE N@NE ) ©) NONE UNDER 18; INAP., 
C@DED 2 IN P.2 
243 1425 KIDS ONE f.: 1 GNE 
242 14.4 KIDS TWO y 2 TWO 
159 9.5 KIDS THREE 3 3 THREE 
72 Gns KIDS 4ORMOR 4 4 FUR 
44 2.6 KIDS 4QORMOR 5 5 FIVE 
26 1.5 KIDS 4ORMGR 6 6 ~ erey 
15 0.8 KIDS 4GRMOR 7 7 SEVEN 
9 0.5 KIDS 4ORMOR 8 8 EIGHT OR MORE 
5 0.2 NA NA 9 9 NA 
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The column headed "SD LABEL" indicates that KIDS has been dichotomized 
into a N@NE group (no kids under 18) and a KIDS group for peopie who have 1 
or more kids under i8. The third group, NA, is a category for those respon- 
dents who did not answer this question. The column headed "SG LABEL" gives 


the same type of information for Standard Groups. 


Assume the user wants to group KIDS into three categories: 

(1) respondents claiming one or two children, 

(2) respondents claiming three or more children, 

(3) respondents not answering plus those claiming no children in the 


family. 
After typing MAP, the POISON system wili respond: 


"KIDS' L@CATED - ANY CHILDREN UNDER 18 IN FAMILY? 
HGW MANY NEW CATEGGRIES? 


7 


Type in the number 3 for the three new groups chosen. The computer will then 
GR@UP 1? 
9 


The answer must be given as a name of the new category (6 characters or less) 
followed by the IMPRESS numbers of the categories to be included in this 
group (from column headed IMP #'S). Separate the name and category numbers 


by commas (no blank) as follows: 


GROUP 1? 
71QR2,1,2 


The computer will then type: 


GROUP 2? 
9 


The second group chosen, 3O@RMORE, would include six categories and can be 


entered as follows: 


GRGUP 2? 
? 3ORMOR , 3-8 


The hyphen saves much typing for larger blocks of categories. As usual, no 


blanks can be used. After the next computer request: 


GRGUP 3? 
9 
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the user must be sure to include all the remaining raw categories for this 
group. Since two categories remain in this example, O and 9, the response 
would be: 


GR@UP 3? 
2N@NENA,0,9 


The computer then types out: 


ENTER EXCLUSION C@DE 

2 
The user is hereby requested to clarify how many of his new categories should 
be included in his tables and calculations. A numerical code is available 


to answer this request: 


QO - for no exclusions. 
1 


- for exclusion of the highest numbered group. 


N 


- for exclusion of the two highest numbered groups. 


(etc.) 


In the sample regrouping, the analyst is assumed to be interesced in the two 
new groupings, 1@R2 and 3@RM@R, and would like to exclude the new grouping, 
N@NENA. The response should be: 

ENTER EXCLUSI@N CODE 

ya 
By typing a 1, the user excludes the highest numbered group, group 3. Note 
that the formation of the new groupings should have this code in mind since 


he can oniy exclude the highest numbered group(s). 


As was mentioned above, a user can give a blanket grouping option to 
cover all of the variables he has chosen. However, the more usuai case will 
require different grouping options for the variables. If the option IND is 
typed, each variable will be treated in turn by the computer. There is an 
alternate method of making exceptions to a blanket option. For example: 

ENTER GROUPING OPTION 

?SG, EXCEPT, EDUC,SD,KIDS,MAP 
Again note the punctuation and the necessary absence of blanks. In this 
example, ali variables will be treated by the "Standard Grouping" option 
except EDUC which will be treated as a "Standard Dichotomy" and KIDS, which 
will be regrouped or "MAP"ed. 
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A final note on the grouping of categories. The order in which new groups 
are formed should be considered in advance if the new groups are to represent 
an ordered grouping. The new group with the highest ordering shouid be entered 


first. Ordering therefore goes from high to iow as tné numbers of the groups 


increase. Remember, HIGHEST GROUP FIRST. 


(D) Choosing Commands 


Having specified the grouping of each variable, the user now enters the 
analytic stage. Two analysis commands are available in the 1971 version of 
POISSON: 


MARG: gives the marginal frequencies for each category of the variables 


requested. 
XTAB: means "cross-tabulation" and gives tables in which each variable 
is run against ali of the other chosen variables. 


In the interactive wersion of POISSON, the computer generates the following 


4 


question once the grcuping options have beén determined: 


ENTER COMMAND 
oe 


° 


A cemmand for analysis ultimately requires the command, the variables to be 
treated, and the statistical option(s) requested for the analysis. The fastest 
way through POISON is to enter ail of this information at once as in the 
following example: 

ENTER CQMMAND 

?XTAB: SEX, EDUC; %A, CHI 
This example requests cross-tabulation of SEX and EDUC with the statistical 
options of ZA and CHI (see below for details on the statistical options). 
Note the necessary punctuation and the absence of blanks. A colon follows 
the command and a semicolon separates the variables from the statistical 


options. 


It is not necessary to supply all this information at once and the 


novice is better off just entering a command as follows: 


ENTER CQMMAND 
?XTAB 


The computer will then enter requests for the variabies and statisticai options 
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in turn: 


ENTER VARIABLES (SEPARATE LIST BY C@MMAS) 
3) 


For three-variable analysis, the order of entry should be: 
CONTROL, INDEPENDENT, DEPENDENT 
For example: 


ENTER VARIABLES (SEPARATE LIST BY C@MMAS) 
?RELIG, OCC1 , V@TED 


In this exemple RELIG is the control variable, @CC1 the independent variable, 
and V@TED the dependent variable. Note that the variables entered must be 
chosen from those specified ac the beginning ef the POISON run. 

Once the variabies are entered, a request is genereted for the statistical 


options desired: 


ENTER GPTIGNS (SEPARATE LIST BY C@MMAS) 


The available opticns (with their code abbreviations) in the 1971 version of 


POISSON are as folicws: 


CHE = Chi Square Statistics 

FREQ - F:cequency Table 

VAN - Percentages Across Table 

“4D ~ Percentages Down Tabie 

*TAB - Percentages of Total Tabie 

N@GTAB - No Tabie 

SELECT - Use One Group of a Variable Only 

REJECT - Reject One Group of a Variable Only 

GAMMA = Gamma Statistic (Yules Q if the table is only 2x2x2) 
EXP - Table of Expected Frequencies 

DELTA - Table of Frequencies minus expected frequencies. 


An example of a response to the request for opticns would be: 


ENTER @PTIGNS (SEPARATE LIST BY C@MMAS) 
?%A, CHI 


Note the absence of blanks and the necessary punctuation. 


Once the command, variables, and statistical cptions have been entered, 


all at once or in steps as has been illustrated immediately above, the analysis 
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stage has been fully formulated. At this point, in the 1971 version of 
POISSON, the interactive version can go no further in producing analysis and 
merely prints out the following message: 


YOUR C@MMAND, VARIABLES, AND @PTI@NS HAVE BEEN ACCEPTED. 
WOULD Y@U LIKE T@ ENTER AN@THER C@MMAND (YES OR NO)? 
2 


This request allows the user to enter up to a total of five commands so that 
he may do several analyses of his variables at one crack. If the answer 
"YES" is typed, he would repeat the procedure for entering a command, the 
variables, and the statistical options after the computer generates the 
requests for this information. A "N@"' answer would produce the following 
message, which is a sample final output of the 1971 interactive version of 
POISSON: 

THE FOLLGWING IS A PRINT-@UT OF YOUR C@NTRGL CARDS FOR 


A 'P@ISSO@N' RUN. THE FIRST TWO LINES ARE CARD C@LUMNS 
AND NOT TO BE PUNCHED. 


1 1 2 2 3 3 4 4 5 5 6 6 7 
8600508 86 One dg 50 aod dablon oo Usmpams wooUon ala goo osuceidc ceO6s 65 as nant!) 


ID PRES68 § REGI@N,RELIG,@CC1,V@TED 
“GROUPING IND REGIGN=SD , RELIG=MAP, 6CC1=SD, V@TED=MAP 

MAP RELIG,1  PROT,O-4;CATH,5;@THNA, 6-13 

MAP VOTED,1  HHH,0,1,14,15;RMN,2,3,16,17;Q@THNA, 4-13, 18-34 


COMMAND MARG:RELIG,V@TED 

COMMAND XTAB:@CC1,V@TED;%A 

C@MMAND XTAB:RELIG,@CC1,V@TED;%A 

COMMAND XTAB:REGI@N,@CC1,V@TED;%ZA 

This output example is the same as the one in the scenario of Chapter One. 
The first two rows of print-out represent punched card column numbers as a 
reference for key-punching. Eight lines of output show the exact format for 
punched cards to do four commands on four variables, two of which variables 


have been regrouped, the other two treated as "Standard Dichotomies". 


"LOGQGUI" procedures then terminate the running of this interactive 
version and the actual calculations are left to the batch system (see Chapter 
Four). The format of the punched cards soon becomes familiar (note the 
importance of columns 1, 11, and 21) so that the user may go directly to the 
batch system after learning what it is all about through the interactive 
version. The sacrifice of immediate feedback of answers in the interactive 


(time-sharing) mode is offeet by the cost effectiveness of using a batch 
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system. Inclusion of all the calculations, data bases, etc., in the 
interactive version is not possible at TUCC at the writing of this primer 


due to storage limitations in the conversational programming system (CPS). 
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Chapter Four 


THE BATCH VERSION OF POISSON 


A. Introduction 


The batch version, POISSON, contains the complete analysis system neces- 
sary to complement the interactive version described in Chapter Three. 
Formatting punched cards (or keyboard entries) for entering a run in the 
batch mode, described in this chapter, is done for the user in POISON. To 
understand the references to the contents of each formatted "card", a know- 
ledge of the process for setting up an analysis, as described in Chapter 


Three, is essential. 


A batch run of POISSON can be described in terms of "control cards". 
Each of the five possible "control cards" represents a line of data on a key- 
board entry terminal or a single punched card. These "control cards" are 
sandwiched in between another set of cards, called "JCL" - Job Control 
Language - cards, the latter being necessary to contact the POISSON system at 
TUCC. "Control cards" are described in section B and 'JCL" cards in section 


C below. 
B. Control Cards for POISSON 
(1) The ID Card: 


Each batch run must contain one ID card and it must be the first control 


card. Its required format is: 


Columns 
1 11 21 
ID survey names of variables to be included in run, separated by 


name commas 


EXAMPLES: ‘(Column numbers are references and are not to be punched.) 


1 A 2 2 3 3 4 4 5 5 6 6 7 
Wes See, Wee) aba wes Oi sig ah wp D> ae 1? Waly yO oe soy 2 dak 06 ieee 
ID PRES68 REGION, RELIG ,@CC1,V@TED 
or 
ID ETHNG TABOOS , TERCUS, SLAVE1 
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(2) The GR@UPING card 


Each batch run must also contain one GR@UPING card and it must be the 


second control card. This card specifies the grouping options for each variable 


used in the run. Two formats are possible for this card--the first being used 
for a blanket grouping option requested for each variable (with exceptions 
noted) and the second for the IND option, in which each variable is grouped 


separately. The format for this control card is: 


Columns 
1 11 21 
GROUPING Grouping Exceptions to the Grouping Option 
option ; 
or 
or 
IND Grouping cptians for each variable 


Examples of the two formats are (cclumni numbers are references and are not to 
be punched): 


i 18 2 2 3 3 4 4 5 5 6 6 7 
J ook soo Riae oom sUlarataae Ken samo om Ors ont ce ool) dma sin cine b ota slp pa, dU) 
GR@UPING SD REL IG=MAP , V@TED=MAP 
or 
GR@UPING IND TABGGS=MAP , TERCUS=SD , SLAVE1=SD 


In the first example cnly the variables which are exceptions to the blanket 
SD grouping options are specified in column 21 onwards. The second example 
calls for the IND option and each variable given on the ID control card must 
be specified for its own grouping option; i.e., TAB@@GS, TERCUS, and SLAVE1 


represent the complete list of variables on the ID card. 


If the user needs more space on the GR@UPING control card, he should 
end his control card at a comma (the end of the grouping option requested 
for any one variable) and start a new card in column 21 continuing his speci- 


fication of grouping options. For example: 


il 1 2 2 3 3 4 4 5 5 6 6 7 
Ge, De aces 6 958 eb aOes ovOe coh ce nebie «cl L. poeta’ Oot soe se MGs iy oro 
GRGUPING IND KIDS=MAP , V@TED=SD ,@CC1=SG ,ROCKY=RAW, 


RFK=SD , ATTHHH=SD , ATTNIX=SD , ATTLBJ=SD 


(3) The MAP card: 


For every variable that uses MAP as a grouping option, a MAP card is 
required which contains the regrouping information necessary for a run (if 
no variables use the MAP option, no MAP card will be used). The MAP cards 
(and the RAW cards described in (4) below) always follow the GROUPING card 
(the order of a series of MAP and RAW cards is immateriai). The required 


format is: 


Columns 

1 11 21 

MAP Name of The new grouping names, each accompanied by appropriate 
Se codebook numbers for the raw categories to be combined. 
Exclusion 
Code 


Example: (punctuation must be precise) 
1 1 2 2 3 3 4 4 5 5 6 6 7 
All fave Dis: og oO aia! sre Digca ool O's) aya ie Doe ope O cpap are Dio oiosiOleve.eveD acre AO v aoe Dre cm vO ees Digs ere) 


MAP VOTED,1  HHH,0,1,14,15;RMN,2,3,16,17;@THNA, 4-13, 18-34 


In this example, the variable V@TED has been regrouped into three new groups: 
HHH (containing the raw categories 0, 1, 14, and 15), RMN (raw categories 2, 
3, 16, and 17), and @THNA (with raw categories 4 through 13 and 18 through 34). 


Note that the exclusion code of 1 is established after the variable name. 


The use of a dash between the extremes of a large group of raw categories 
eases the punching process. Nevertheless, there will be cases in which a 
card (or a line) is not long enough for all the information and continuation 
can be done on a new card or line starting at column 21. The break at the end 


of any one card (or line) must occur after a semicolon. 


Example: 


1 1 2 2 3 3 4 4 5 5 6 6 7 
Dadidae 4408 60sec eaO sas Sea ctOe see tu eOs te 51e 4 dO ss 6c Be ev cOv scans ose 


MAP @ccl,1 PROF ,0-199;MGRS, 200-299; CLERKS , 300-349; 
SALES , 350-399 ; CRAFTS, 400-499 ;GPS , 500-599; 
OTHNA, 600-998 


(4) The Raw card: 


For every variable that uses RAW as a grouping option, one RAW card is 
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necessary. The RAW card(s) contains user-suppiied names of the raw categories of 
the variable requested and the exclusion code fsx these categories. This card can 


take two basic formats as follows: 


Columns 
Al Js). 21 
RAW Variable Names of Raw Categories 
Name oe 
and 
Exclusion Number 
Code 
Examples: 


1 1 2 2 3 3 4 4 5) 5 6 6 7 
il Rerev2)-)!s0s. ave! Olevarere'Diore dose Uic 0X00 Diovcrel e Wave 6) of Diotetetnl Oloral cinid io) clovel cholerae ove cla Vioto ete Dieye.c76 0) 
RAW KIDS ,2 ZER ,@NE, TWO, THREE, FOUR, FIV, SIX, SEV, EIGHT ,@THNA 

cr 
RAW RELIG,1 NUMBER 
The establishment of names for the raw categories is aliowea only when there are 
10 or less raw categories. If there are more than 10 raw categories, they will 
be numbered and the word NUMBER is punched in columns 21 through 26, 
The exciusicn code format is identical to that vor che MAP card. However, 

a default exclusion code is available tor each variable and, if no comma and 
number are entered after the wariable name, this default exclusien code is 


invoked. Fcr exampie: 


1 1 2 2 3 3 4 4 5 5 6 6 7 
Lov cDs cc cOoesedee 2 0Oe0soD000e0o0000000 000 escd00000sccrdo000Ucee0d0e000 


RAW REGIGN N@RTH, SOUTH , EAST, WEST 


(5) The C@MMAND card: 


Each batch run must contain at least one C@MMAND card and can have as many 
of these cards as desired. This card contains the C@MMAND desired (MARG or 
XTAB), the variables to be used in this command (in the following order if 
only three variables--CONIROL, INDEPENDENT, and DEPENDENT wariabies), and 


the statistical options requested for the command. The format of this card 
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is as follows: 


Columns: 

1 11 

COMMAND Type of command, variables, and statistical opticns 
Examples: 


1 1 2 2 3 3 4 4 5 he 5 6 6 a 
LPs SOT AMO Ga toe eis Sie sort Wine, p De sie sess oie tel) aie Shek Ole sels Demeester. 
C@MMAND MARG:RELIG,V@TED 
C@MMAND XTAB:@CC1,VQTED;Z%A 
COMMAND XTAB:RELIG,@CC1,V@TED;%A 
COMMAND XTAB:REGIG@N,@CC1,V@TED;%A,GAMMA 


Ncte the colon between the command and the variables, the semicolon between 
the variables and the statistical options, and the commas between variables 


and between statistical options (when there are more than one). 


C. JCL Cards 


A batch run must tell the computer where to bill you and how to access 
POISSON itself. Five cards are necessary and they are always of the tollowing 
form (note the sandwiching in of the control cards described in section (B) 


above): 


/fjobname J@B xxx.yyy.zzz,name 
// EXEC PG@ISSGN 
/7SYSIN. DB .* 
Control Cards 
/* 
// 


The smail letters in the first card, the J@B card, are user-supplied. The 
"Jobname" is any name of eight characters or less to identify your run. The 
"xxx. yyy.zzz'' is an account code available from whoever manages your terminal. 
Your name, "name", is another user-supplied item. All of the rest remains 
exactly as in the above example. Sandwich in the control cards, get this 


deck (or keyboard lines) into your terminal and you're all set. 
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Chapter Five 


SAMPLE RUNS - BATCH VERSION 


This chapter is devoted to illustrating output from the batch version, 
POISSON, for the statistical options available in the system. Under the 
C@MMAND of XTAB (cross-tabulation), the following statistical options are 


available in the 1971 version of the system: 


CHI Chi Square Statistic 

FREQ Frequency Table 

van Percentages Across Table 
“D Percentages Down Table 
“TAB Percentages-of-Total Table 
NGTAB No Table 


SELECT Use One Group of a Variable Oniy 
REJECT Reject One Group of a Variable Only 


GAMMA Gamma Statistic (Yule's Q for a 2x2x2 table) 
EXP Table of Expected Frequencies 
DELTA Table of Frequencies minus Expected Frequencies 


These options will be illustrated with output from a batch run with a series 


of CGMMANDS run on one set of variables. 


CONTROL CARDS: 


1 dt 2 2 3 3 4 4 5 5 6 6 7 
PFIE oD stay ate Oict.chic oD ee votio Woo) dre Dinia 10 cl Oaloravro  evatene eG ator Die: evere Olevotcte Dio sta oO aveuaveoonewater() 


ID PRES68  §REGIG@N,RELIG,KIDS 
GROUPING IND REGIG@N=SD ,RELIG=MAP , KIDS=SG 
MAP RELIG,1 PROTS,0-4;CATH,5;Q@THER,6-13 


C@MMAND MARG:RELIG 

COMMAND XTAB:RELIG,KIDS;CHI,FREQ,%A,%TAB, GAMMA 
COMMAND XTAB:REGI@N,RELIG,KIDS;EXP,DELTA,GAMMA 
COMMAND XTAB:RELIG,KIDS;FREQ, SELECT (KIDS=THREE) 


shh. 


Three variables from PRES68 are used to illustrate the statistical options: 
REGIQ<N (standard dichotomy), RELIG (MAP), and KIDS (standard grouping). The 
first command illustrates the use of MARG to check the resulting marginals for 
the variable RELIG after this variable was regrouped ("MAP'ed"). XTAB is used 


in the remaining commands to give sample output for the statistical options. 


Commands two and four deal with two-variable anaiysis while command three 
utilizes a ecntrol variable. The REJECT option is not illustrated since its 


format is identical to that of the SELECT option seén in the fourth command. 


Output for this single run with four commands is as foliows: 


ID PRES68 QCC1,REGIQN, RELIG, KIDS 
GROUPING IND $CC1=SD ,REGIGN=SD ,RELIG=MAP , KIDS=SG 
MAP REELC il. PROTS,0-4;CATH,5 ;QTHER, 6-13 


YGUR COMMAND: MARG:RELIG 


MARGINAL FREQUENCIES 


SoocococsclcS S232SSSsSSsece 


RELIG = RELIGIOUS PREFERENCE 
PRGTS 1167 69.8% 

CATH 348 20.8% 

Q@THER 158 9.4% EXCLUDED 
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a 


YQUR CGMMAND: 


RELIG BY 


DGWN: 
ACRGSS:? 


PRG@TS 
CATH 


TOTAL 


PRGTS 
CATH 


TOTAL 


PRGIS 
CATH 


TOTAL 


PROTS 
CATH 


TOTAL 


XTAB: RELIG ,KIDS;CHI ,FREQ,ZA, 4D, 4TAB, GAMMA 


KIDS 


RELIGIGUS PREFERENCE 
NUMBER GF CHILDREN UNDER 18 


FREQUENCY TABLE 


NGNE QNE 
619 dW 
i59 59 
775 225 


PERCENTAGES ACRGSS 


SSS esses SSS SSR 


NGNE ONE 
A 14.8% 
45.84 They ass 
Spee Gy4 14.9% 


PERCENTAGES DOWN 


79.0% 76.4% 
20.4% 23.6% 
778 225 


NGNE . QNE 


40.9% 11.42 
10.5% 3.5% 
51.52 14.9% 


TAG 
173 
43 


216 


Tw@ 
14.8% 
12.4% 


14.32% 


PL oS 


THREE 
8,7% 
10.7% 


9.1% 


73.2% 
26.8% 


138 


THREE 
6.7% 
2.4% 


Oh 


sion 


4ORMGR 
8.64 
15.9% 


16.3% 


64.5% 
35.5% 


155 


4ORMOR 
6.6% 
3.6% 


10.3% 


77.1% 
22.9% 


1512 


TOTAL 
7.1% 
22.9% 


1512 


CHI SQUARE = 18.890 WITH 4 D.F. 
THE PROBABILITY @F CHI SQUARE IS < _.001 


GAMMA 


-0,150 (RELIG =. PROTS ) TENDS WITH (KIDS 


EXCLUSI@N ANALYSIS: 


TABLE TOTAL 
EXCLUDED 
SAMPLE SIZE 


YOUR CQ@MMAN 
REGIG@N BY 
CONTRGLI: 


DOWN: 
ACRQSS: 


REGION = 


PROTS 
CATH 


TOTAL 


PROTS 
CATH 


= 1512 
= i161 


D: XTAB:REGIGN,RELIG, KIDS; EXP ,DELTA,GAMMA 


RELIG BY KIDS 
RESPQNDENT'S RESIDENCE 


RELIGI@US PREFERENCE 
NUMBER QF CHILDREN UNDER 18 


NTSQUT 


EXPECTED CELL FREQUENC 


IES 


NONE QNE TWO THREE 
339 100 95 66 
157 46 44 30 
496 146 139 96 


NONE ONE TWO THREE 

14 af 4 ~2 

-14 3 -4 2 
ctihe 


4ORMOR 
72 
34 


106 


4ORMOR 
-13 
us 


= 4QRM@R) 


TOTAL 
673 
311 


983 


SAMPLE SIZE 


G = -0.127 (RELIG = PROTS) TENDS WITH 


= S@UTH 


NONE @NE TWO THREE 
263 74 72 39 
19 5 5 5} 
282 79 77 42 


NGNE ONE TWO THREE 
3 1 2 -2 
-3 =i -2 2 
G = -0.249 (RELIG = PRGTS ) TENDS WITH 
PARTIAL = -0.,136 (RELIG = PRGTS ) TENDS WITH 


ZERO ORDER = -0.150 


(RELIG = PRGTS ) TENDS WITH 


EXCLUSIG@N ANALYSIS: 
TABLE TOTAL = 1512 


161 
1673 


= 5= 


= 4@RM@R) 


TOTAL 
493 
36 


529 


= 4@RM@R) 


4ORMOR) 


= 4@RM@R) 


YOUR COMMAND: XTAB: RELIG, KIDS 3FREQ, SELECT (KIDS=THREE) 
RELIG BY KIDS 
DOWN: RELIGIOUS PREFERENCE 
ACROSS: NUMBER $F CHILDREN UNDER 18 
FREQUENCY TABLE 

NONE ONE TWO THREE 
PROTS 0 0 0 101 
CATH 0 0 0 37 
TOTAL 0 0 0 138 


EXCLUSION ANALYSIS: 


TABLE TOTAL = 138 
EXCLUDED =) 4535 
SAMPLE SIZE. = 1673 


7 


TOTAL 
101 
37 


138 


Chapter Six 


THREE-VARIABLE ANALYSIS PROGRAMS IN THE 
TUTORIAL-INTERACTIVE MODE 


Two programs dealing with three-variable analysis are availabie in the 
CPS system and thus in the interactive mode, These programs, cailed 2X2X2 and 
WORDS, allow a user to enter his own data for a 2x2x2 ecmmtingensy table and 
analyze this data using the gamma statistic (called Yule’s Q when the 
contingency tabie is limited in size te 2x2x2). Beth programs allow the 
"decomposition" of the gamma relationship between the dependent and independ- 
ent variables in terms of the contrel variable so that the ccntrol variable 
ean be determined as "explaining," having "no effect" on, "suppressing," or 
merely partially affecting, the reletionship between the dependent and inde- 
pendent variabies, The program 2X2X2 performs the calculations and tabulates 
the results while WORDS aids in preparing 4 verbal report of the analysis of 


the relationship. 


The two programs have two purposes distinet from those cf the POISSON 
system. They perform a tutorial tesk in teaching the understanding of gamma 
as used with a 2x2x2 contingency table. Secondly, they provide the user with 
a methed er analysing published results, in both the pepular media and in 
scientific journals, if the variable reported can be easiiy breken into 
standard dichetomies and if the frequencies of the observations are available. 
Specific examples of both purposes are furnished in the fullowing sample runs 


of 2X2X2 and WORDS. 


A. An "Explanation" Variable 


The sociologist Jan Hajda surveyed the réading habits in a probability 
sample of 1,850 women in Baltimore, Maryland. He observed a low negative 
correlation between age (under 45 vs. 45 or older) and bookreading (the number 
of books the respendent claimed to have read in the previous year). Suspecting 
that this correlation may be due to another variable, Hafda drew on education 
as a possibility. The historical analysis will be followed below using 2X2X2, 
WORDS, and Hajda's raw data. This illustration wiil demonstrate the tutorial 
purpose of the programs as well as the research-checking capacity as an educa- 


tionai tool. 


oh? 


2XLX2 


SERVICE? 
A CPS 

7KKM. PVF S25, ane 

GOD AFTERNOON; USER 04; TIME 13:28:29 10/27/71 


2LGaD {2X 2X2) PUBLIC 


#XBQ 
B {D6 YOU WANT USER'S INSTRUCTIONS? ‘Y" QR 'N’ 
(aN 
/ SUPPLY LABELS @R ACCEPT X,Y,4T (‘LABELS' @R ‘X/T') 
?LABELS 
\ N@iW Y@UR LABELS 
|=@R DEPENDENT VARIABLE, Y: 
| ¢READ , HIGH, LOW 
gc < 
| FOR INDEPENDENT VARIABLE, X: 
TAGE, 4502, 44DW 
| FOR TEST VARYABLE, T: 
| ®EDUC ,HIGH,LESS 
DiD ¥@U ENTER Y@UR VARIABLES C(/RRECTLY? 
TYPE ‘Y' If Y@U DID, 'N’ If Y@U WANT T@ KE-DO THEM 
LY 
= 
| ENTER A,B,C,D,E,F,G,H 
\a 
\2215,263,373,453,335,54,133,24 
\ RAW DATA 
/READ/ 
EDUG AGE LOW HIGH 
D Sas a= =sSS== J === == 
HIGH 45UP 215 263 478 
; 44DW 373 453 8 
588 716 
| LESS 45UP 335 54 389 
4aDW 133 24 157 
| —— — —< 
£68 78 
1850 
\ 
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2X2K2 


User diais into the CPS system and is recognized from his account code. 


He loads the program 2XZX2 and cails fer the execution of the program (two 
distinct steps indicated by the initial question marks). The computer asks 
the user if he needs instructions and he answers NG. 


Then 2X2X2 asks for variable labels. Hajda's variables are entered as user 
supplied codes--READ is the cede fer the dependent variable "book reading"-- 
the dichotomy categaties being given as HIGH and L@W; AGE (the independent 
variable} is dichotomized into 45UP (45 and cider) ws 44DW (44 and below); 
and the control variable education (EDUC) is dichotomized into HIGH (high 
school educacisn and above) and LESS (less then full high school education). 
Note that the labei for che higher order of each dichotomy is entered first 
and that the punctuation is critical. The computer ailows a check of these 
label entries and will ask tor them again if the user catches an error in 
his iabeis. 


Hajda’s raw data is entered into the computer. User's instructions, not 
called upon in this run, show the relation between the table entries and 
the A through H items requested. The raw data is entered in one line (they 
could be entered oné at a timey and the computer echoes owt the raw data 
2X2X2 tabie (with marginals). Ir incorrectly entered, 2X2X2 wiil then 
aliow reentry cf the data after the check. 
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IS THIS TABLE CORRECT? ('Y' OR 'N'‘) 

2 ¥ 
ie ¥G@U WANT THE MARGINAL PERCENTAGES? 

2 


MARGINAL PERCENTAGES 


AGE READ EDUC 


NN en 


E PERCENT PERCENT PERCENT 
UPPER LIMIT (.95) 49.1 45.2 72.6 
| SAMPLE VALVE 46.9 42.9 2045 
{| LOWER LIMIT (.95} 44.6 A0.? 68.4 
e 
"(po xu WANT A PERCENTAGE TABLE? 
7¥ 
PERCENT HIGH 
AGE 
# 44DW 45UP 
HIGH 54.8 55.0 
(826) (478) 
EDUC 
LESS 15.3 13.9 
(157) (389) 


~~ 


UPPER LIMIT (.95) s>L92 -.554 815 
SAMPLE VALUE —.240 -.620 BSS) 


DO Y@U WANT ZERG GRDER Q's 
4 

- ZER@ GRDER Q LIMITS AND SIGNIFICANCE 

E Q(AGE,READ) Q(EDUC,AGE) Q(EDUC,READ) 
LOWER LIMIT (.95) -.328 -.687 703 
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Now the analysis, step by step. The computer asks if a MARGINAL PERCENTAGE 
TABLE is desired. An answer of Y produces this table, a relevant considera- 
tion before analysis since the table provides some degree of assurance of 
the validity of analysis based on the size of the dichotomies. The per- 
centages and upper and lower limits indicate a reasonable cell size for 

each dichotomy and the analysis continues. 


The program then asks if a PERCENTAGE TABLE is desired. Again, an answer 
cf Y produces the table. The percentage high on READ is given for EDUC 
vs AGE. A good analyst would quickly note that the percentage high on 
reading is almost identical for both the higher education group and the 
lower education group and that the percentage difference in reading for 
both age groups is about constant. A good indication of the effect of 
education on reading habits regardless of the age group! 


Now the gamma analysis! (Yules Q is the gamma statistic for a 2x2x2 table 
and the programs use Q throughout for this 2x2x2 analysis.) An answer 

of Y produces the zero order Q's--each combination of variables without 

a control. Note the low negative association between AGE and READ 
discovered by Hajda (-.240). Aiso note the strong negative association 
between EDUC and AGE and the strong positive association between EDUC 

and READ. This large spread (with opposite signs) indicates a strong 
possibility of EDUC "explaining" the association between AGE and READ. 
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Dd YOU WANT THE DEC@MP@SITIGN GF Q% 


fon 

| DECGMPGSITIGN GF C 

/ PATR 
u / (AGE,READ) (AGE, EDUC) (EDUC, READ) 
] Q ZER@ PRDER -.240 ~,620 2759 
Q 23RD ITEM TIED 0.000 -.629 S15? 
| WEYSHT 3RD ITEM TIED 498 465 443 
' © 3RD ITEM DTTFERS ~.475 - 640 - Tel 
| WEIGHT 3RD ITEM DiFFERS £502 «ao sSor 
sue MINUS P ~.4/8 0. 005 


THERE ARE N9 SIGNIFICANT (,25) SPECIFICATIONS OF Q: 
DG YT WANT IO SEE THE CONDITIONAL Q'S ('Y' GR 'N‘) 


oY 
Q 3RD ITEM POSITIVE 2004 -.589 of OF 
, 2 WEIGHT 3RD ITEM PQS 462 aU? 2262 
T ")Q 32D iTEM NEGATIVE -.055 -.627 74k 
WEIGHT 3RD ITEM NEG .036 2 387 »182 
CHI-SQUARE SPECIFICATION .172 .164 .167 
SIGNIFICANCE, 1D.F. 10 -10 .10 


em a a ee a ee rr a ar ee OR reed ae Ym ee ee ee me ee ee ee ee cee ee ee ne ae ee ee ee ae OS 


ARE Y@U DONE? 
re 4 
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mm HH HEH He He HE HEH EE & FF 


The user responds affirmatively to a DECQ@MP@SITIGN GF Q. Partial and 
Differential Q's are produced in the subsequent table (along with the 
corresponding weights). Now lcok at Q for AGE vs READ--it disappears 
(0.000) when EDUC is controlled (EDUC tied) and becomes moderately 
negative or stronger for the differential (EDUC differs). Hajda's con- 
clusion that education "EXPLAINS" the relationship between reading and 
age is justified. 


A "specification" analysis is given for the raw data. There are no 
significant specifications. The analysis complete, the user says he is 
done. A novice who didn't understand all the logic in the analysis 
above--especially with respect to the terminology-—-now may want to use 
WORDS to heip him write a report on the above analysis. A run of WORDS 
follows: 
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?LG@AD (WIRDS) PUBLIC 
3 ? XEQ 
WOULD YOU LIKE ME T@ WRITE THE FIRST DRAFT GF YOUR REP@RT? 
TYES 
oe 
PLEASE TYPE IN THE NAMES $F Y@UR INDEPENDENT, DEPENDENT, 
AND TEST VARIABLES. (iF THEY ARE M@RE THAN 10 LETTERS L@NG 
USE ABBREVIATIONS). ; 
) WHAT IS THE NAME OF YOUR DEPENDENT VARIABLE. Y? 
?READ 
K 
WHAT IS THE NAME GF Y@UR INDEPENDENT VARIAELE, X? 
? AGE 
/ AND WHAT TS JHE NAME OF YOUR TEST VARLABLE, “2? 
| ?EDUC 
DG Y@U WANT INPUT INSTRUCTIONS? 
INO 
ENTER A,B,C,D,E,F,G,H 
iiss 265,57 34493,/9599 05,180 026 
DID YU ENTER YOUR DATA CORRECTLY? 
\ YES 
HERE WE GG! 
FIRST, LET'S LOGK AT THE ZER@ @RDER RELATIGNSHIP 
BETWEEN AGE AND READ. 
M 


WE SEE THAT THERE 15 A L@W NEGATIVE ASSGCIATIGN BETWEEN 
AGE AND READ (Q= -0.24). 


N@W LET'S LOOK AT THE ZER@ Q@RDER ASSGCIATIGNS 
INV@LVING EDUC. 
WE SEE THAT THERE IS A VERY STRONG NEGATIVE ASS@CIATION BETWEEN 
EDUC AND AGE (Q= -0.62) AND A VERY STR@NG P@SITIVE ASSGCIATION 
BETWEEN EDUC AND READ (Q= 0.76). 
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L. 


The user loads and asks for the execution of the program, WORDS. He 
answers that he wants help in writing his report. 


As in 2X2X2, he enters.the dependent, independent end test (control) 
variables, this time not naming his dichotomy categories since this program 
merely treats each dichotomy as HIGH vs LQW. 


Again refusing user instructions (he shouid ask for them the first time he 


runs) he enters the same data as he did for 2X2X2. 


The zero order associations are given in a verbal report form. Evidence 
is quickly had as to the possibility of "explanation" of the zero order 
(AGE,READ) relationship due to the opposite signs of the other zero order 
relationships between AGE and EDUC and between KEAD and EDUC. 
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READ WHEN WE CGNIRGL FOR EDUC? TYPE IN THE 
NUMBER NEXT TG YGUR PREDICTIGN. 


IT WILL BECWME LESS NEGATIVE 7 


IT WILL BEC@ME MORE NEGALIVE 8 
IT WON'T BE AFFECYED MUCH ; 9 


WHAT DG YGU THINK? 

Z 

27 

THAT'S RIGHT 

SINCE THE ASSG@CIAIIGNS HAVE OPPOSITE SIGNS, THE 


FROM WHAT WE KN@W ALREADY, WHAT D@ Y@U THINK WILL 
HAPPEN I@ THE ASS@CiATIGN BETWEEN AGE AND 


NEGATIVE ASS@CIATIGN BETWEEN AGE AND 
READ WILL PROBABLY BECG@ME LESS NEGATIVE WHEN 
EDUC IS CONTROLLED. 


| SPECULATION. LEI'S L@CK AI THE PARTIAL. 


WELL, WHAT D@ ¥YGU KNOW! ‘THE PARTIAL Q ASS@CIATIGN BETWEEN 
AGE AND READ C@NIRGLLING FOR EDUC IS 

0.00, NG ASS@CIATIGON. 
‘HUS, EDUC EXPLAINS THE RELATIGNSHIP BETWEEN 
AGE AND READ. THAT I8, WHILZ AGE AND 
READ HAVE A LOW NEGATIVE ASSGCIATICN, LI BECOMES 
NEGLIGIBLE WHEN EDUC IS C@NIR@LLED. 


ean 


IN THE TWO CATEGORIES @F THE TEST VARIABLE, EDUC. 
THEY MIGHT BE RADICALLY DIFFERENT, BUT WE HAVE N@ WAY GF 
PREDICTING FROM THE PATTERN @F ZERG @GRDER ASSGCIATIGNS. 


HERE THEY ARE: AMONG THOSE SCORED PLUS GN EDUC 

THERE IS N@ ASSGCIATIGN BEIWEEN AGE 

AND READ (G= 0.60). ‘AM@NG TH@SE SCORED MINUS 

@N EDUC THERE IS A NEGLIGIBLE NEGATIVE ASS@CIATIGN BETWEEN THEM. 
(Q= -0.06). 


BUT WE AREN'T THRGUGH YET THAT PARTIAL RELATIONSHIP IS A 
SGRT GF AVERAGE GF THE AGE AND READ RELATIG@NSHIFS 


BY MAKING A SIGNIFICANCE TEST F@R THE DIFFERENCE IN THE 

TWO VALUES OF Q, WE CAN TEST WHETHER THE DIFFERENCE BETWEEN 
THE TWé6 ASS@CIATIGNS IS RELIABLE. THE DIFFERENCE FAILS T@ 
REACH THE 0.025 SIGNIFICANCE LEVEL. THUS, WE HAVE N@ FIRM 
EVIDENCE THAT EDUC SPECIFIES THE RELAII@NSHIP BETWEEN AGE 
AND READ AND HAVE ABGUI THE SAME ASS@OCIATIGN IN EACH LEVEL. 


eh ee 
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N. 


O. 


ee 


The computer checks to see if the user understands this early indication 
of the effect of the control by the test variable. He does} 


Then the computer analyses by the "partial Q" and comes up with a concrete 


report result--the EXPLANATION. 


WORDS then checks the possibility of SPECIFICATION by calculating the 
positive and negative associations. A significance test, chi-square, is 
reported as not significant at the 0.025 level. 
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THAT'S IT! ALL Y@U NEED T@ DG NOW IS T@ SCRATCH UP 

S@ME BIBLIGGRAPHIC REFERENCES. H@WEVER, I MADE A FEW ARBITRARY 
DECISI@GNS IN MY ANALYSIS, AND YOU GUGHT T@ L@@K AT THE DATA GN 
YOUR @GWN. 

WHY DON'T Y@U IRY 2X2X2? 
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Q. 


A final pedogogicai note: 
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B. A "Specificatioa' Variable 


A second example cf the use of 2X2X2 and WORDS for undergraduate research 
on published data is briefly presented to illustrate the fiexibility of the 
programs. The Stauffer data from a 1954 national survey (Samuel A. Stauffer, 
"Communism, Conformity, and Civil Liberties," Doubleday, New York, 1955) indi- 
cated a subcultural difference in Southern woting behavior from the general 
behavior in the U.S. This difference is reriected in the preference for 
Eisenhower as compared to party prererence. The new data being available, 


along with dichotomies, one can jump into 2K2X2 and WORDS. 


agi 


SERVICE? 
CPS 
?7ExXx, yyy, 2Zz,name 


GO@D AFTERN@GN; USER 09; TIME 16:30:50 10/28/71; 


2L@AD (2X2X2) PUBLIC 
2XEQ 


DG. Y@U WANT USER'S INSTRUCTIGNS? '‘'Y' @R ‘N’ 

oN 

SUPPLY LABELS @R ACCEPT X,Y,&T (‘LABELS‘ @R 'XYT‘) 
?LABELS 


NOW Y@UR LABELS 
FOR DEPENDENT VARIABLE, Y: 
?PREF, IKE, STEV 


F@R INDEPENDENT VARIABLE, X: 
?REGIGN, SOUTH, @THER 


FOR TEST VARIABLE, T: 
?PARTY, REP, DEM 
DID Y@U ENTER Y@UR VARIABLES C@RRECTLY? 


TYPE 'Y' IF Y@U DID, ‘'N' IF Y@U WANT T@ RE-D@ THEM 


27Y 


ENTER A,B,C,D,E,F,G,H, 
A 
226,231, 26,1061, 505, 327,930,297 


RAW DATA 

/PREF/ 
PARTY REGIGN STEV IKE 
REP SGUTH 26 231 
QGTHER 26 1061 
52 1292 
DEM SGUTH 505 327 
Q@THER 930 297 
1435 624 


IS THIS TABLE C@RRECT? (‘Y’ GR 'N') 
2Y 
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832 
1227 


3403 


The user logs-in, calls 2X2X2, and asks to run the program. 


The labels and data are fed in and 2X2K2 echoes out the contingency table. 
A request for an editorial check brings 4 response that ail is O.K.--the 
dependent variable being PREF (woting preference) with the dichotomy 
categories of IKE and STEV (ror Eisenhower and Stevenson); the independent 
variable REGION with dichotemy categories of SQUTH and @THER; and the test 
variable PARTY with dichetomy categories of REF and DEM. 
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Dé Y@U WANT THE MARGINAL PERCENTAGES? 


ay. 
MARGINAL PERCENTAGES 
REGION PREF PARTY 
PERCENT PERCENT PERCENT 
UPPER LIMIT (.95) 33.6 56.0 4i.L 
SAMPLE VALUE : 32.0 56.3 39.5 
LOWER LIMIT (.95) 30.4 54.6 : 37.9 


DG Y@U WANI 4 PERCENTAGE TABLET 


ry 
PERCENI IRE 
REGION 
OTHER SOUTH 
REP 97.6 89.9 
. (2087) (257) 
PARTY 
DEM 24.2 39:0 2 
(1227) (832) 


B® ¥OU WANT ZERO GRDER Q’S 


2¥ 
ZEKO GKDER Q LIMIIS AND SIGNIFICANCE 
QCREGIGN PREF} QCPARTY,KEGION) Q(PARTY,PREF} 
UPPER LIMIT (.95) -.078 ~.420 975 
SAMPLE VALUE ~,149 ~ 482 ,966 
LOWER LIMIT (.95) =,219 =o 544 6956 


DO YOU WANT THE DEC@MPGSITIGN OF Q? 
aY 


DECOMPOSITION GF Q 
PAIR 
(REGION,PREF) (REGIGN,PARTY) (PARTIY,PREF) 


SC ee ee ce a me ee ee ee ee ee ee oe nce ee ee ee ee ee 
DR cee ee ee ee ae ee a a ee er ee a ee ee eee ee ee ee ee a ee ee ee ee ee 


Q ZERO ORDER -.149 -.482 966 
Q 3RD ITEM TIED 272 - 589 2971 
WEIGHT 3RD ITEM TIED » 389 30h 2593 
Q 3RD ITEM DIFFERS -.417 -.419 958 
WEIGHT 3RD ITEM DIFFERS 611 629 407 


D MINUS P -. 688 ol fi ~ O12 


Ge The MARGINAL PERCENTAGES all fall within the 30-70 percent range making 
analysis feasible. 


D. The requested PERCENTAGES indicate different trends in SGUTH and Q@THER as 
well as in the Republican and Democratic party with respect to voting 
preference for IKE. 


E. Zero order Q's show a low negative association between REGIGN and PREF, a 
moderate negative association between PARTY and REGI@N, and a very strong 
positive association between PARTY and PREF (the latter being not too 
shocking). A decomposition of the Q‘s seems cailed for. 


F. The association between REGIQN and PREF turns moderately positive when 
controlling with tied pairs on the test variable PARTY (partial Q) while 
the differential Q (differing pairs for PARTY) showed moderately nega- 
tive. This result (different signs--large difference) begs looking at 
the specification possibility. What happens in each PARTY? 
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Y@U HAVE A SIGNIFICANI SPECIFICATIGN @F Q: HERE IS THE ST@RY 


Q3RD ITEM P@SITIVE -. 641 -.699 864 
WEIGHT 3RD ITEM PGS 2027 ge 2066 
Q 3RD ITEM NEGATIVE 0339 °296 0984 
WEIGHT 3RD ITEM NEG 0362 2031 0527 
CHI-SQUARE SPECIFICATIGN 107.789 53.230 18.626 
SIGNIFICANCE, 1 D.F. 001 0001 001 
?LGAD (WARDS) PUBLIC 

?7XEQ 


WOULD YG@U LIKE ME T@ WRITE THE FIRST DRAFT GF Y@UR REPQ@RT? 
?YES 


PLEASE TYPE IN THE NAMES QF Y@UR INDEPENDENT, DEPENDENT, 
AND TEST VARIABLES. (IF THEY ARE MORE THAN 10 LETTERS LONG 
USE ABBREVIATIGNS) . 


WHAT IS THE NAME GF Y@UR DEPENDENT VARIABLE, {7 


?PREF 
WHAT IS THE NAME @F YGUR INDEPENDENT VARIABLE, x7 
?REGIGN 
AND WHAT iS THE NAME @F Y@UR TEST VARIABLE, T? 
2? PARTY 
D@ Y@U WANT INPUT INSTRUCTIONS? 
rants) 
ENTER A,B,C,D,E,F,G,H 
A 


226,231, 26,1061,505, 327,930,297 
DID YGU ENTER YOUR DATA CGRKECTLY 
2YES 


HERE WE G@: 


FIRST, LET'S L@@K AT THE ZER@ ORDER RELATIGNSHIP 
BETWEEN REGIGN AND PREF. 


WE SEE THAT THERE IS.A L@W NEGATIVE ASS@CIATIGN BETWEEN 
REGI@N AND PREF (Q= -0.15). 


NOW LET'S LO@K AT THE ZER@ ORDER ASSPCIATIGNS 
INV@LVING PARTY. 
WE SEE THAT THERE IS A MODERATE NEGATIVE ASS@CIATION BETWEEN 
PARTY AND REGI@N (Q= -0.48) AND A VERY STRONG P@SITIVE ASSGCIATIGON 
BETWEEN PARTY AND PREF (Q= 0.97). 
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The specification is significant. Q, for the Republican party only, 
between REGIQ@N and PREF is strongly negative while Q, for the Democratic 
party, between REGIGN and PREF is moderately positive--an even bigger 
association difference than the partial and differential Q's showed. 
There was.a subcultural difference between the South and the rest of the 
nation: 

This is difficult for the novice to put into a report, so WORDS is 
called. 


WORDS is called, the program is requested to run, and the variable names 
and raw data are entered correctly. 


Zero order relationships are calculated and one report—-like conclusion is 
reached: "There is low negative association between REGIQN and PREF 


(Q = -0.15)." The analysis now turns to see what effect PARTY has on this 


association. 


+ 


FR@M WHAT WE KN@W ALREADY, WHAT DO YGU THINK WILL 
HAPPEN 19 THE ASSOCIATIGN BETWEEN REGIGN AND 

PREF WHEN WE CONTRGL FOR PARTY? TYPE IN THE 
NUMBER NEXT T@ YOUR PREDICTIGN. 


IT WILL BECOME LESS NEGATIVE 7 
IT WILL BECOME MORE NEGATIVE 8 


IT WON'T BE AFFECTED MUCH y 


WHAT D@ YOU THINK 

Z 

c7 

THAT'S RIGHT 

SINCE THE ASS@CIATIGNS HAVE GPPGSITE SIGNS, IHE 
NEGATIVE ASSQCYATION BETWEEN REGION AND 

PREF WILL PROBABLY BEC@ME LESS NEGATIVE WHEN 
PARTY IS C@NIRGLLEL . 


ENGUGH SPECULATI@N, LET'S LOGR AI THE PARTIAL. 


WELL, WHAT D@ *@0 KNOW! TAME FARYIAL Q ASSOCIATION BETWEEN 
REGI@N AND FREF CGNIROLLING FOR PakTY Is 
0.27, A L@W POSITIVE ASSOCIATION. 
THE INTR@DUCIIGN GF PARTY INI@ IHE ANALYSIS 
HAS LEAD US T@ C@OMPLEIELY REVISE OUR THINKING ABGUT 
REGION AND PREF, AT THE ZERG GRDER LEVEL, WE 
SAW A L@W NEGATIVE ASSOCYAIION BETWEEN THEM, BUT 
WHEN PARTY IS CONIR@LLED, A LOW POSITIVE 
ASSOCIATION APPEARS. 


BUT WE AREN'I THRYUGH YET THAI PARTIAL RELAIIGNSHIP IS A 
S@RT GF AVERAGE OF THE KEGIGN 4£AND PREF RELATIGNSHIPS 

IN THE TWO CATEGORIES Gf THE IESIT VARIABLE, FARTY. 

THEY MIGHT BE RADICALLY DIFFERENT, BUT WE HAVE NG WAY OF 
PREDICTING FROM THE PATTERN OF ZER@ ORDER ASSPCLATIPONS. 


HERE THEY ARE: AM@GNG TH@SE SCORED PLUS @N PARTY 

THERE IS A VERY STRONG NEGATIVE ASSOCIATION BETWEEN REGIGN 

AND PREF (Q= -0.64). AM@NG IH@SE SCORED MINUS 

@N PARTY THERE IS A M@DERATE POSITIVE ASS@CIATIGN BETWEEN THEM 
(Q= 0.34). ; 


BY MAKING A SIGNIFICANCE TEST #@R THE DIFFERENCE IN THE 
TWO VALUES @F Q, WE CAN TEST WHETHER THE DIFFERENCE BETWEEN 
THE TWé ASSOCIAIIGNS iS RELIABLE 


THE DIFFERENCE IS SIGNIFICANT AT THE .025 LEVEL. 
THUS, @UR SECG@ND MAJ@R CONCLUSIGN IS THAT PARTY 
SPECIFIES THE RELATIONSHIP BETWEEN REGI@N AND 
PREF. THAT IS, THE STRENGTH OF THE RELATIG@NSHIP 
BETWEEN REGI@N AND PREF VARIES WITH THE 

, Saeee OF PARTY. 
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K. 


The user is asked whether the other zero order Q's indicate what controlling 
by PARTY might mean to the (REGIQN, PREF) association. He cerrectly indi- 
cates that the signs of these associations (-.482 and .966) are opposite 

and should make the (REGI@N,PREF) Q less negative. 


‘Analysis goes to the partial and ditterentiai Q's of (REGI@N,PREF) control- 
ling for PARTY. A low positive association (less negative as predicted) 
of 0.27 is discovered. PARTY sure changed things. 


The test of significance of a "specification" of the associaticn by PARTY 

is calculated. A second major conclusicn is reached and a report state— 

ment is generated: "The difference is significant---. That: is, the strength 
of the relationship between REGIQ@N and PREF waries with --- PARTY." A 
subcultural difference is quantitatively expressed. 
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THAT'S IT! ALL Y@U NEED T@ D6 NOW IS TG SCRATCH UP 

S@ME BIBLI@GRAPHIC REFERENCES. H@WEVER, I MADE A FEW ARBITRARY 

DECISI@NS IN MY ANALYSIS, AND Y@U GUGHT T@ LOGK AT THE DATA @N 

YOUR OWN. 

WHY D@N'T Y@U TRY 2X2X2? 

?7LGGOUT 

TIME 16:47:47; TIME USED: CPU 00:60:02; TERM 00:16:56; PAGE 00:46:33 


-?@e 


M. The usual pedagogic departure. 


A better understanding of the logic of this approach can be found in James 
A. Davis, "Elementary Survey Analysis," Prentice-Hall, Engleweced Cliffs, N.J., 
1971. The data for the above two examples were taken from a preliminary draft 
of this book. 
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